An Improved Thresholding Function and Sparse Subspace Decompo- Sition for Speech Enhancement and Its Application to Speech Recognition

نویسندگان

  • Mohamed anouar Ben messaoud
  • Aïcha Bouzid
چکیده

Kurzfassung: In this work, we propose an unsupervised monaural Arabic speech enhancement method that is based on two different techniques. The main idea is to determine an exact threshold value in the wavelet domain depending on the voicing state of the Arabic speech signal. Our proposed voiced/unvoiced decision algorithm based on the Multi-scale Product (MP) analysis is used. The MP is based on the multiplication of wavelet transform coefficients at three successive dyadic scales. Then, we apply a denoising technique based on the thresholding of the discrete wavelet transform coefficients. The threshold values change either when the signal is voiced or unvoiced. Further, a subspace decomposition-based post-processing technique is implemented. The Fast Fourier Transform (FFT) of the obtained frames is decomposed into three subspaces: sparse, low rank, and the remainder noise components. Experimental results show that the proposed approach outperforms the compared speech enhancement methods for noise-corrupted Arabic speech at low levels of SNR. Beside, we present the evaluation results for automatic recognition on enhanced Arabic speech signal. We reconstitute the clean Arabic speech from noisy observations based on a sparse imputation technique. It employs a non-parametric model and finding the sparsest combination of exemplars that jointly approximate the reliable features of a noisy Arabic utterance.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A New Method for Speech Enhancement Based on Incoherent Model Learning in Wavelet Transform Domain

Quality of speech signal significantly reduces in the presence of environmental noise signals and leads to the imperfect performance of hearing aid devices, automatic speech recognition systems, and mobile phones. In this paper, the single channel speech enhancement of the corrupted signals by the additive noise signals is considered. A dictionary-based algorithm is proposed to train the speech...

متن کامل

Speech Enhancement Through an Optimized Subspace Division Technique

The speech enhancement techniques are often employed to improve the quality and intelligibility of the noisy speech signals. This paper discusses a novel technique for speech enhancement which is based on Singular Value Decomposition. This implementation utilizes a Genetic Algorithm based optimization method for reducing the effects of environmental noises from the singular vectors as well as t...

متن کامل

Speech Enhancement Through an Optimized Subspace Division Technique

The speech enhancement techniques are often employed to improve the quality and intelligibility of the noisy speech signals. This paper discusses a novel technique for speech enhancement which is based on Singular Value Decomposition. This implementation utilizes a Genetic Algorithm based optimization method for reducing the effects of environmental noises from the singular vectors as well as t...

متن کامل

Speech enhancement based on hidden Markov model using sparse code shrinkage

This paper presents a new hidden Markov model-based (HMM-based) speech enhancement framework based on the independent component analysis (ICA). We propose analytical procedures for training clean speech and noise models by the Baum re-estimation algorithm and present a Maximum a posterior (MAP) estimator based on Laplace-Gaussian (for clean speech and noise respectively) combination in the HMM ...

متن کامل

Speech Enhancement using Adaptive Data-Based Dictionary Learning

In this paper, a speech enhancement method based on sparse representation of data frames has been presented. Speech enhancement is one of the most applicable areas in different signal processing fields. The objective of a speech enhancement system is improvement of either intelligibility or quality of the speech signals. This process is carried out using the speech signal processing techniques ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017